Goto

Collaborating Authors

 Pristina


Detecting and Monitoring Bias for Subgroups in Breast Cancer Detection AI

arXiv.org Artificial Intelligence

Early breast cancer detection (BCD) through mammography screening continues to be a major focus in radiology as it plays a critical role in reducing mortality rates (Coleman (2017); Ginsburg et al. (2020)). Although artificial intelligence (AI) models can help radiologists to evaluate mammograms (Sahu et al. (2023); Evans et al. (2013); Maxwell (1999)), training such models face the challenge of limited datasets that may not fully represent all subgroups or cover variations in data distributions. Historically, certain racial groups face barriers to healthcare access because of many socio-economic factors (Azin et al. (2023); Hershman et al. (2005); Hussain-Gambles et al. (2004)). This lack of access can result in datasets that do not adequately represent these groups, potentially cause AI models to show biases for these groups. Even with seemingly balanced datasets, subtle biases may persist in the collected data due to systemic inequalities in the quality of healthcare (Obermeyer et al. (2019)). Among these groups, African American patients are often underrepresented in both breast imaging and broader healthcare datasets (Yedjou et al. (2019); Newman and Kaljee (2017)).


What is in a name? Mitigating Name Bias in Text Embeddings via Anonymization

arXiv.org Artificial Intelligence

Text-embedding models often exhibit biases arising from the data on which they are trained. In this paper, we examine a hitherto unexplored bias in text-embeddings: bias arising from the presence of $\textit{names}$ such as persons, locations, organizations etc. in the text. Our study shows how the presence of $\textit{name-bias}$ in text-embedding models can potentially lead to erroneous conclusions in assessment of thematic similarity.Text-embeddings can mistakenly indicate similarity between texts based on names in the text, even when their actual semantic content has no similarity or indicate dissimilarity simply because of the names in the text even when the texts match semantically. We first demonstrate the presence of name bias in different text-embedding models and then propose $\textit{text-anonymization}$ during inference which involves removing references to names, while preserving the core theme of the text. The efficacy of the anonymization approach is demonstrated on two downstream NLP tasks, achieving significant performance gains. Our simple and training-optimization-free approach offers a practical and easily implementable solution to mitigate name bias.


Robust image representations with counterfactual contrastive learning

arXiv.org Artificial Intelligence

Contrastive pretraining can substantially increase model generalisation and downstream performance. However, the quality of the learned representations is highly dependent on the data augmentation strategy applied to generate positive pairs. Positive contrastive pairs should preserve semantic meaning while discarding unwanted variations related to the data acquisition domain. Traditional contrastive pipelines attempt to simulate domain shifts through pre-defined generic image transformations. However, these do not always mimic realistic and relevant domain variations for medical imaging such as scanner differences. To tackle this issue, we herein introduce counterfactual contrastive learning, a novel framework leveraging recent advances in causal image synthesis to create contrastive positive pairs that faithfully capture relevant domain variations. Our method, evaluated across five datasets encompassing both chest radiography and mammography data, for two established contrastive objectives (SimCLR and DINO-v2), outperforms standard contrastive learning in terms of robustness to acquisition shift. Notably, counterfactual contrastive learning achieves superior downstream performance on both in-distribution and on external datasets, especially for images acquired with scanners under-represented in the training set. Further experiments show that the proposed framework extends beyond acquisition shifts, with models trained with counterfactual contrastive learning substantially improving subgroup performance across biological sex.


NoPhish: Efficient Chrome Extension for Phishing Detection Using Machine Learning Techniques

arXiv.org Artificial Intelligence

The growth of digitalization services via web browsers has simplified our daily routine of doing business. But at the same time, it has made the web browser very attractive for several cyber-attacks. Web phishing is a well-known cyberattack that is used by attackers camouflaging as trustworthy web servers to obtain sensitive user information such as credit card numbers, bank information, personal ID, social security number, and username and passwords. In recent years many techniques have been developed to identify the authentic web pages that users visit and warn them when the webpage is phishing. In this paper, we have developed an extension for Chrome the most favorite web browser, that will serve as a middleware between the user and phishing websites. The Chrome extension named "NoPhish" shall identify a phishing webpage based on several Machine Learning techniques. We have used the training dataset from "PhishTank" and extracted the 22 most popular features as rated by the Alexa database. The training algorithms used are Random Forest, Support Vector Machine, and k-Nearest Neighbor. The performance results show that Random Forest delivers the best precision.


Evaluating the Adversarial Robustness of Retrieval-Based In-Context Learning for Large Language Models

arXiv.org Artificial Intelligence

With the emergence of large language models, such as LLaMA and OpenAI GPT-3, In-Context Learning (ICL) gained significant attention due to its effectiveness and efficiency. However, ICL is very sensitive to the choice, order, and verbaliser used to encode the demonstrations in the prompt. Retrieval-Augmented ICL methods try to address this problem by leveraging retrievers to extract semantically related examples as demonstrations. While this approach yields more accurate results, its robustness against various types of adversarial attacks, including perturbations on test samples, demonstrations, and retrieved data, remains under-explored. Our study reveals that retrieval-augmented models can enhance robustness against test sample attacks, outperforming vanilla ICL with a 4.87% reduction in Attack Success Rate (ASR); however, they exhibit overconfidence in the demonstrations, leading to a 2% increase in ASR for demonstration attacks. Adversarial training can help improve the robustness of ICL methods to adversarial attacks; however, such a training scheme can be too costly in the context of LLMs. As an alternative, we introduce an effective training-free adversarial defence method, DARD, which enriches the example pool with those attacked samples. We show that DARD yields improvements in performance and robustness, achieving a 15% reduction in ASR over the baselines. Code and data are released to encourage further research: https://github.com/simonucl/adv-retreival-icl


From the evolution of public data ecosystems to the evolving horizons of the forward-looking intelligent public data ecosystem empowered by emerging technologies

arXiv.org Artificial Intelligence

Public data ecosystems (PDEs) represent complex socio-technical systems crucial for optimizing data use in the public sector and outside it. Recognizing their multifaceted nature, previous research pro-posed a six-generation Evolutionary Model of Public Data Ecosystems (EMPDE). Designed as a result of a systematic literature review on the topic spanning three decade, this model, while theoretically robust, necessitates empirical validation to enhance its practical applicability. This study addresses this gap by validating the theoretical model through a real-life examination in five European countries - Latvia, Serbia, Czech Republic, Spain, and Poland. This empirical validation provides insights into PDEs dynamics and variations of implementations across contexts, particularly focusing on the 6th generation of forward-looking PDE generation named "Intelligent Public Data Generation" that represents a paradigm shift driven by emerging technologies such as cloud computing, Artificial Intelligence, Natural Language Processing tools, Generative AI, and Large Language Models (LLM) with potential to contribute to both automation and augmentation of business processes within these ecosystems. By transcending their traditional status as a mere component, evolving into both an actor and a stakeholder simultaneously, these technologies catalyze innovation and progress, enhancing PDE management strategies to align with societal, regulatory, and technical imperatives in the digital era.


Set-Aligning Framework for Auto-Regressive Event Temporal Graph Generation

arXiv.org Artificial Intelligence

Event temporal graphs have been shown as convenient and effective representations of complex temporal relations between events in text. Recent studies, which employ pre-trained language models to auto-regressively generate linearised graphs for constructing event temporal graphs, have shown promising results. However, these methods have often led to suboptimal graph generation as the linearised graphs exhibit set characteristics which are instead treated sequentially by language models. This discrepancy stems from the conventional text generation objectives, leading to erroneous penalisation of correct predictions caused by the misalignment of elements in target sequences. To address these challenges, we reframe the task as a conditional set generation problem, proposing a Set-aligning Framework tailored for the effective utilisation of Large Language Models (LLMs). The framework incorporates data augmentations and set-property regularisations designed to alleviate text generation loss penalties associated with the linearised graph edge sequences, thus encouraging the generation of more relation edges. Experimental results show that our framework surpasses existing baselines for event temporal graph generation. Furthermore, under zero-shot settings, the structural knowledge introduced through our framework notably improves model generalisation, particularly when the training examples available are limited.


Counterfactual contrastive learning: robust representations via causal image synthesis

arXiv.org Artificial Intelligence

Contrastive pretraining is well-known to improve downstream task performance and model generalisation, especially in limited label settings. However, it is sensitive to the choice of augmentation pipeline. Positive pairs should preserve semantic information while destroying domain-specific information. Standard augmentation pipelines emulate domain-specific changes with pre-defined photometric transformations, but what if we could simulate realistic domain changes instead? In this work, we show how to utilise recent progress in counterfactual image generation to this effect. We propose CF-SimCLR, a counterfactual contrastive learning approach which leverages approximate counterfactual inference for positive pair creation. Comprehensive evaluation across five datasets, on chest radiography and mammography, demonstrates that CF-SimCLR substantially improves robustness to acquisition shift with higher downstream performance on both in- and out-of-distribution data, particularly for domains which are under-represented during training.


Distributed AI in Zero-touch Provisioning for Edge Networks: Challenges and Research Directions

arXiv.org Artificial Intelligence

Zero-touch network is anticipated to inaugurate the generation of intelligent and highly flexible resource provisioning strategies where multiple service providers collaboratively offer computation and storage resources. This transformation presents substantial challenges to network administration and service providers regarding sustainability and scalability. This article combines Distributed Artificial Intelligence (DAI) with Zero-touch Provisioning (ZTP) for edge networks. This combination helps to manage network devices seamlessly and intelligently by minimizing human intervention. In addition, several advantages are also highlighted that come with incorporating Distributed AI into ZTP in the context of edge networks. Further, we draw potential research directions to foster novel studies in this field and overcome the current limitations.


Full text: NATO Vilnius summit communique

Al Jazeera

NATO leaders are holding their annual summit as Ukraine looks to the security alliance for support in its attempt to push back invading Russian forces. The Vilnius communique, however, while emphasising NATO's support for Ukraine, gave no clear timetable on when the country might be able to join the alliance, in a major disappointment for Ukrainian President Volodymyr Zelenskyy, who had travelled to the Lithuanian capital. "Ukraine's future is in NATO," the leaders said in the joint statement on Tuesday. "We will be in a position to extend an invitation to Ukraine to join the alliance when allies agree and conditions are met," the declaration said, without specifying the conditions. The communique also touched on the Asia Pacific, with the leaders of Australia, Japan, New Zealand and South Korea all attending as NATO allies. It said China was a challenge to NATO's interests, security and values with its "ambitions and coercive policies" triggering a furious response from Beijing. And it accused Beijing and Moscow of "mutually reinforcing attempts to undercut the rules-based international order". China has said it wants peace in Ukraine, but has not condemned Russia's full scale invasion since it began in February 2022. NATO is a defensive Alliance. It is the unique, essential and indispensable transatlantic forum to consult, coordinate and act on all matters related to our individual and collective security. We reaffirm our iron-clad commitment to defend each other and every inch of Allied territory at all times, protect our one billion citizens, and safeguard our freedom and democracy, in accordance with Article 5 of the Washington Treaty. We will continue to ensure our collective defence from all threats, no matter where they stem from, based on a 360-degree approach, to fulfil NATO's three core tasks of deterrence and defence, crisis prevention and management, and cooperative security. We adhere to international law and to the purposes and principles of the Charter of the United Nations and are committed to upholding the rules-based international order. This Summit marks a milestone in strengthening our Alliance. We look forward to our valuable exchanges with the Heads of State and Government of Australia, Japan, New Zealand, and the Republic of Korea, as well as the President of the European Council and the President of the European Commission at this Summit. We also welcome the engagements with the Foreign Ministers of Georgia and the Republic of Moldova, and with the Deputy Foreign Minister of Bosnia and Herzegovina, as we continue to consult closely on the implementation of NATO's tailored support measures. This is an historic step for Finland and for NATO. For many years, we worked closely as partners; we now stand together as Allies. NATO membership makes Finland safer, and NATO stronger. Every nation has the right to choose its own security arrangements.